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Abstract 

O . This paper introduces a new counting code. Its design was moti- 

vated by distributed video coding where, for decoding, error correction 
CN ■ methods are applied to improve predictions. Those error corrections 

sometimes fail which results in decoded values worse than the initial 



£T) \ prediction. Our code exploits the fact that bit errors are relatively un- 

likely events: more than a few bit errors in a decoded pixel value are 
rare. With a carefully designed counting code combined with a predic- 
tion those bit errors can be corrected and sometimes the original pixel 
' value recovered. The error correction improves significantly. Our new 

code not only maximizes the Hamming distance between adjacent (or 
"near-1") codewords but also between nearby (for example "near-2") 
codewords. This is why our code is significantly different from the 
5_i ■ well-known maximal counting sequences which have maximal average 

Hamming distance. Fortunately, the new counting code can be derived 
from Gray Codes for every code word length (i.e. bit depth). 



1 Introduction 

The idea behind of Distributed Video Coding was established in the 1970's 
by Slepian and Wolf [TT] and by Wyner and Ziv [14]. However, it was not 
before the year 2000 that a push was undertaken to actually establish work- 
ing distributed video coding systems. In the last few years many impressive 
and thought provoking research publications appeared on the subject ( cp. 
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[T5] for an extensive bibliography). To summarize all those findings would 
go beyond the scope of this paper. Currently, one of the biggest unsolved 
problems in distributed video coding is how to design error correction codes 
specifically for distributed video coding. 

The overall principle in a distributed video compression system is the fol- 
lowing: the decoder generates a preliminary prediction of the frame to be 
decoded. In some systems, this "preliminary prediction" is generated by mo- 
tion interpolation (PI), in some other systems by spatial interpolation ([6], 
[4]) or some hybrid interpolation schemes(|12|). Of course, those frame pre- 
dictions are erroneous. In distributed video coding the prediction errors are 
interpreted as it transmission errors and error correction methods known 
from channel coding are applied to improve the quality of those preliminary 
predictions. Of course, error correction methods are probabilistic methods 
and even though an improvement is possible for the majority of the pixels 
it cannot be avoided that some pixel values are "mis-corrected" in practice; 
the corrected pixel value is worse than the predicted pixel value. This is why 
most distributed video compression systems employ a "reconstruction step" 
which is often some form of thresholding. This reconstruction step decides 
whether the predicted pixel value or the error corrected pixel value is taken 
as final output pixel value. Resorting to the initially predicted pixel value 
rather than improving it is obviously undesirable which is why most authors 
compromise the simplicity of the encoder and choose more powerful and more 
complex error correction methods. 

In contrast to this we found that it is much more advantageous to remap 
the pixel values to a new bit representation such that we can take additional 
advantage of the prediction. It is obvious that: 

1. a small prediction error is more likely than a large prediction error, and 

2. a small number of mis-corrected bits in a pixel value is more likely than 
a larger number of mis-corrected bits. 

Our idea is simple but powerful; rearrange the bit representation of pixel 
values such that many error correction mistakes and a big prediction errors 
are unlikely to happen at the same time. This motivates a new binary code 
such that 

1. the Hamming distance between adjacent codewords (i.e. codewords 
representing pixel values differ by one) is high; 
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2. the Hamming distance between nearby codewords can be smaller than 
the Hamming distance between adjacent codewords, however, it is still 
large enough to facilitate later error correction; 

3. the Hamming distance between codewords representing very different 
pixel values can be small. 

Of course, for a compression application it is desirable not to add redundancy 
during encoding ( e.g. increasing the number of bits per pixel). This con- 
strains the new code to be a counting code. What a counting code is will be 
explained later. To our knowledge this kind of code has never been studied 
or used before. 

This paper is organized as follows: in Section [2] we introduce the nomencla- 
ture and some mathematical background of counting codes. We will explain 
first how to generate our new codes and study some of their important prop- 
erties. Of course, an example will be presented to illustrate our findings. In 
the next section, Section [3l we discuss how our new codes can be put into 
practice and what the advantage of our new code is. In the last section which 
is Section [H we outline what we believe would be valuable future work on 
this topic. 

2 New Counting Codes 

2.1 Nomenclature and previous work 

Let S(n\p) be a sequence of p distinct binary n-tuples. The sequence S is 
called a counting sequence of length n if all 2™ possible n-tuples are visited. 
In the sequel the binary n-tuples will be called codewords. Of course a code- 
word is a binary representation of a pixel value. We will index codewords in 
the sequence S from to p — 1 and denote the jth codeword by Xj, < j < p; 
the bit positions within a codeword Xj will be counted from 1 to n going from 
the right to the left. The rightmost bit is assumed to be the least significant 
bit. 

The average Hamming distance of a sequence is the average Hamming dis- 
tance between the p pairs of successive (or "near-1") codewords. It is obvious 
that a cyclic Gray sequence (a sequence of codewords where each pair of suc- 
cessive codewords, including the pair of the first and the last codeword, differ 
in only one bit) has an average Hamming distance of one for example. For 
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many applications sequences with maximum average Hamming distance are 
important. The following theorem establishes bounds for the Hamming dis- 
tance if the sequence is a counting sequence: 

Theorem 1 The average Hamming distance d^of a counting sequence S(n), 
n > 1 is bounded according to 



A proof for this theorem can be found in [T5] and [9]. A counting sequence 
with average Hamming distance n — | is called maximum counting sequence. 
An example for a maximal counting sequence is: 



It can be seen easily that in order to achieve a maximum average Hamming 
distance with a counting sequence the Hamming distance of near- 1 codewords 
must alternate between n and n — 1. The following was proven in [9]: 

Theorem 2 A maximum counting sequence exists for all n > 1 . 

However, for an application in distributed video coding maximizing the near- 
1 Hamming distance is not enough, e.g. near-2 distances have to be consid- 
ered as well. 

2.2 New Code Generation 

It took several attempts to find a code that satisfies the above design. Rather 
than subjecting the reader to all the mistakes we made and dead ends we 
headed down while developing our code we will simply present our current 
code, study its properties, and illustrate with an example how it is applied 
later in Section [3j 

In the previous section we saw that a maximum counting sequence pro- 
tects mainly neighboring codewords against bit errors. Within this section we 
will now outline how to derive a new code of length n from binary-reflected 
Gray codes of length n— 1 such that near-1 neighbors are sufficiently protected 
as well as near-2 neighbors. Of course, there are many ways to construct sim- 
ilar codes, however, we have decided to present one which is easy to derive. 




(1) 



000, 111, 001, 110, 011, 100, 010, 101. 
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First, we will quickly repeat how a Gray code of length n — 1 is generated by 
the well known binary reflection method: we start with an initial Gray code 
of length 1 which is (0, 1). Then, the initial Gray code is listed in reverse order 
which results in (1, 0). Next, the initial Gray code and the reverse listed code 
are concatenated. This results in (0,1,1,0). The length of each codeword 
now gets increased; the initial Gray code gets the prefix whereas the reverse 
listed code gets the prefix 1. This results in the code (00,01,11,10). this is 
the binary-reflected Gray code of length n = 2. This process is iterated until 
it results in a binary-reflected Gray code of length n — 1. Let this code be 
(xo, xi, x 2 , . . . , Xp-i). To derive our new code this binary-reflected Gray code 
of length n — 1 is listed in reverse order and concatenated to the original code 
again. This results in (x , x±, X2, ■ ■ ■ , x p -%, x p _i, x p - 2 , . . . , Xi, x ). Next, every 
second codeword £2.7+1 for j — 0, . . . , 2™ _1 — 1 is bitwise complemented. This 
results in the sequence (xq, Cxi, x 2 , Cx% . . . , Cx p _i, x p _i, Cx p _2, . . . , x%, Cxq) 
where Cx denotes the bitwise complement of x. Next, the code gets an 
alternating prefix throughout the sequence which results in 

(0x , lCxi, 0x 2 , lCx 3 . . . , lCxp_i, 0x p _i, lCx p _2, . . . , Oxi, lCx ). 

This is our new code of code length n. 

2.3 Example 

To make our example short we assume a bit depth of n = 4 only and show 
the generation of the counting code in Table [H 

2.4 New Code Characteristics 

Theorem 3 Let (xq, xi, X2, ■ ■ ■ , £ 2p -i) denote our new counting code. Then 

d H (X( k +l) mod 2p, Xk mod 2p) > U ~ 1, Vfc G N. 

Here, n denotes the codeword length of the new code whereas p is the number 
of the codewords of the Gray code from which the new code is derived. 
Proof: By construction it is 

(x , xi,..., x 2p -i) = (0y , ICyi, 0y 2 , lCy 3 lCy p - 1} 0y p -%, lCy p _ 2 , . . . , Oyi, lCy ), 

where the sequence (yo, ■ ■ ■ , y P -i) is a Gray code with n-1 bits. Thus, any 
two adjacent codewords differ in 1 bit because of the prefix and in at least 
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Table 1: Generation of the new counting code 



Start 


Step 1 


Step 2 


Step 3 


Binary reflective 








Gray code (n-l=3) 


Mirror 


Take complements 


Add prefixes 


000 


000 


000 


0000 


001 


001 


110 


1110 


011 


011 


011 


0011 


010 


010 


101 


1101 


110 


110 


110 


0110 


111 


111 


000 


1000 


101 


101 


101 


0101 


100 


100 


Oil 


1011 




100 


100 


0100 




101 


010 


1010 




111 


111 


0111 




110 


001 


1001 




010 


010 


0010 




Oil 


100 


1100 




001 


001 


0001 




000 


111 


1111 
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n — 2 bits because one of the Gray codes was complemented. □ 
This means that near-1 neighbors are protected by a sufficiently large Ham- 
ming distance. It is interesting to note that for some near-1 codewords the 
Hamming distance will be equal to n: d-n(xo, x p _i) = n and dfi(x p -i, x p ) = n. 
We are now looking into the Hamming distance of near-2 codewords: 

Theorem 4 



Proof: Obviously, the prefix (the first bit) of near-2 codewords is the same. 
Thus, only the remaining part will contribute to the Hamming distance. If 
k mod p 7^ p — 2 or k mod p p — 1 then all three suffixes of the codewords 
£fcmod2p,£(fc+i)mod2 P and X( fc+2 ) m od2p come from different Gray codes. The 
Hamming distance thus equals 2. If k mod p = p — 2 or k mod p = p — 1 
then either the first two or the last two remaining parts are derived from the 
same Gray code. The resulting Hamming distance is therefore only one. The 
same argument applies to k mod p = 2p — 2 and k mod p = 2p — 1. □ 
It is interesting to note that by construction the near-2 Hamming distance 
does not depend on the bit depth n. 

2.5 Continuation of Example 12.31 

In Table [2] we show the pixel values, the new codewords representing the 
pixel values, the near-1 and the near-2 Hamming distance. There, it can 
be seen that there are irregularities in both the near-1 and near-2 Hamming 
distance: for k = 7 and for k = 15 the near-1 Hamming distances increase 
whereas the near-2 Hamming distances decrease. Of course this is because 
of mirroring of the underlying Gray code. 

2.6 Discussion 

We have seen in the example above that there are irregularities in the Ham- 
ming distances. As mentioned above our new code was designed to be used 
in conjunction with a prediction. This is why the irregularity in the near-1 
and the near-2 Hamming distance caused by the last and the first codeword 
is irrelevant for applications like distributed video coding etc.; any useful 
prediction will not mistake k = for k = 2p — 1 and vice versa. 




k mod p = p — 2 V k mod p = p — 1 
else 
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Table 2: New counting code with its near-1 and near-2 Hamming distance. 



Pixel value 


Pixel value 


near-1 


near-2 


k 


represented in 


Hamming distance 


Hamming distance 




new counting code 


dn( x (k+l)mod 2p, Xkmod 2p) 


^w(^(fc+2)mod 2p, £fcmod 2p) 





0000 


3 


2 


1 


1110 


3 


2 


2 


0011 


3 


2 


3 


1101 


3 


2 


4 


0110 


3 


2 


5 


1000 


3 


2 


6 


0101 


3 


1 


7 


1011 


4 


1 


8 


0100 


3 


2 


9 


1010 


3 


2 


10 


0111 


3 


2 


11 


1001 


3 


2 


12 


0010 


3 


2 


13 


1100 


3 


2 


14 


0001 


3 


1 


15 


1111 


4 


1 
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The irregularity in the middle of the code, near k = p is more of a concern; of 
course not the increase in the near-1 Hamming distance but that the near-2 
Hamming distance drops to one only for any given bit depth n. The imme- 
diate question is whether those irregularities can be avoided. The following 
theorem is helpful in answering this question. 

Theorem 5 There exists no counting sequence {xo, X\, . . . , x p -i} such that 

d-ni^k mod p, £(fe+i) mod p) = I for all fceN, where I is even. 

Proof: If d-n(xkmod p, 2 ; (fc+i)mod p) is even then the codewords Xkmodp and 
£(fc+i)modp must have the same parity (e.g. even number of zeros). This 
means all codewords of the code must have the same parity ( all codewords 
must have an even number of zeros). This means that the code can never 
be a counting code (the codewords with an odd number of zeros are never 
visited). □ 

This was first presented in [9]. It means that a counting code with an even 
uniform near-1 Hamming distance is impossible. However, as it follows from 
our construction above it is very well possible to generate a code with an 
almost uniform even near-1 Hamming distance for any given bit depth n G N. 

3 Application 

Above we have mentioned that our new code was motivated by distributed 
video compression. Now, we will show exactly how to take advantage of it in 
applications such as distributed video coding for example. In most practical 
video applications the pixel bit depth equals 8 and the pixel values range 
from to 255. However, we assume that the bit depth is n = 4 and use the 
code we have generated above in Example 12.51 for the sake of brevity. The 
codewords are assumed to represent pixel values between and 15. 

3.1 Example 

Let us assume that the original pixel value is x = 7. Now, in our new code 
the value 7 is represented as the codeword x-j = (1011). This pixel value 
is predicted with a sufficiently accurate prediction method and the obtained 
prediction value (in its new binary representation) is error corrected. Let us 
assume this prediction is 8 which is represented by the code word (0100). 
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Given the low dynamic range in our example (bit depth equals only 4) a pre- 
diction mismatch of less than 2 can be considered as a poor prediction, how- 
ever, it is still sufficient for our purposes as we will see. The prediction value 
8 is now error corrected which results in (1001) which is 11. It is important to 
note that our proposed method does not depend on any particular method 
of error correction and we don't want to distract with unnecessary detail. 
However, some readers might find it still interesting to know that we have 
turned blocks of pixels (and in some other experiment transform coefficients) 
into bit streams by bit plane scanning. We have then used a turbo code with 
short constraint length to do the error correction. The decoded value 11 dif- 
fers significantly from the predicted value 8. This suggests that a decoding 
error has occurred. Now, the four codewords with a Hamming distance of 
one to the codeword (1001) are considered to be the most likely candidates 
for the final output. These four codewords are (0001), (1101), (1011), (1000) 
and represent the values 14, 3, 7 and 5 respectively. One simple approach to 
find a final candidate value is to take the value closest to the predicted value 
as final output value. In our example 7 is the final output value. This last 
step is an analogy to syndrome-coding [7J. 

In previous work on distributed video coding a popular method to con- 
struct the final output value is basically thresholding: if the predicted value 
and the (preliminarily) decoded value differ too much then the final output 
value is set to be the predicted value. So, the final output value for our exam- 
ple would only be 11. It is worthwhile to note that the actual reconstruction 
used in systems as e.g. [4] or [2] are more sophisticated: the reconstruction 
makes use of dithering and the boundaries of quantization bins for example 
and the final reconstruction value would be 8 or 9 depending on the actual 
implementation. 



3.2 Illustration 

Example video frames which illustrate the advantage of our counting codes 
are shown in Figure [U Depicted on the left hand side are video frames which 
were first predicted and then error corrected with a turbo code. The video 
frames on the right hand side were produced by employing our counting code 
technique. One must assume that the error correction occasionally fails on all 
bit planes. Clearly visible (in printouts of those video frames) are only black 
and white spots in the left video frames where the error correction failed on 
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significant bit planes. For example, when predicting a flat area (e.g. the pave- 
ment in the pedestrian video) even an excellent prediction, when bit plane 
wise scanned, might appear as an error burst to an error correction method 
such that it will fail. Of course, the white and black artifacts can easily be 
removed by reconstruction methods similar to thresholding, too. However, 
our method further corrects less obvious errors which greatly contributes to 
the PSNR but which are not visible in printouts of the video frames. Fur- 
thermore, it is interesting to know that we have employed the (7,5) turbo 
code with a block length of approximately 1KB. This is known to perform 
very poorly but with our new technique even this simple error correction is 
good enough to achieve a very successful reconstruction. To us it seems that 
it is not important to use a sophisticated error correction method (like e.g. 
more sophisticated turbo or LDPC codes, eventually with error concealment) 
or to construct special error correction codes for a Distributed Video Coding 
System but to transform the input data which is not a complex operation. 

Figure 1: DVC compressed and reconstructed video frames; the frames on 
the left were produced with a system using the (7,5) turbo code, whereas the 
frames on the right were produced using the same system with our proposed 
counting code technique. 



4 Final Remarks and Future Work 

In Example 12.51 above it was mentioned that the irregularity in the middle of 
the code is a weakness of our code. However, in practical applications where 
the bit depth is higher than n = 4 the code is longer. Thus the irregularities 
can easily be shifted to represent pixel values which are less important if this 
is necessary. 

In Section 12.61 we have discussed the near-1 Hamming distance and near-2 
Hamming distance of our new code. We currently believe that for a larger 
bit depth n there is no need to protect adjacent pixel values with a near-1 
Hamming distance of n — 1. It might be worthwhile to find a code which 
sacrifices near-1 Hamming distance performance in favour of near-2 or even 
near-3 Hamming distance performance. 
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